Extraction of Temporal Networks from Term Co-Occurrences in Online Textual Sources

نویسندگان

  • Marko Popovic
  • Hrvoje Stefancic
  • Borut Sluban
  • Petra Kralj Novak
  • Miha Grcar
  • Igor Mozetic
  • Michelangelo Puliga
  • Vinko Zlatic
چکیده

A stream of unstructured news can be a valuable source of hidden relations between different entities, such as financial institutions, countries, or persons. We present an approach to continuously collect online news, recognize relevant entities in them, and extract time-varying networks. The nodes of the network are the entities, and the links are their co-occurrences. We present a method to estimate the significance of co-occurrences, and a benchmark model against which their robustness is evaluated. The approach is applied to a large set of financial news, collected over a period of two years. The entities we consider are 50 countries which issue sovereign bonds, and which are insured by Credit Default Swaps (CDS) in turn. We compare the country co-occurrence networks to the CDS networks constructed from the correlations between the CDS. The results show relatively small, but significant overlap between the networks extracted from the news and those from the CDS correlations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TimeTrails: A System for Exploring Spatio-Temporal Information in Documents

Information Extraction • a lot of information only published in unstructured format→ textual documents Spatial and Temporal Information •widely spread in text documents • can be extracted and normalized • useful for search and exploration tasks Events • happen at specific place and time • space/time as two dimensions of events • co-occurrences of spatial and temporal expressions form events Doc...

متن کامل

Multilingual Artificial Text Extraction and Script Identification from Video Images

This work presents a system for extraction and script identification of multilingual artificial text appearing in video images. As opposed to most of the existing text extraction systems which target textual occurrences in a particular script or language, we have proposed a generic multilingual text extraction system that relies on a combination of unsupervised and supervised techniques. The un...

متن کامل

Structure of ethnic violence in Sudan: a semi-automated network analysis of online news (2003-2010)

Mining textual sources of data can be used to design studies and test theories at temporal and spatial scales unheard of in the past. This opens up new opportunities for conflict studies and ethnographic research. We conducted a semi-automated network analysis of the 2003–2010 Sudan Tribune online news articles and modeled ethnic-group conflict in Sudan. We tested whether an ethnic group’s conn...

متن کامل

A Bayesian Sampling Method for Product Feature Extraction from Large Scale Textual Data

The authors of this work propose an algorithm that determines optimal search keyword combinations for querying online product data sources in order to minimize identification errors during the product feature extraction process. Data-driven product design methodologies based on acquiring and mining online product-feature-related data are faced with two fundamental challenges: 1) determining opt...

متن کامل

Spatio-Temporal, Mineralogy and Micro-Morphology of Dust Occurrences and Centers with Internal Sources in the Khouzestan Province

Extended abstract 1- Introduction Dust occurrences as natural events are common in arid, semi-arid and desert areas. Investigation of the dust with internal sources in the Khuzestan province including about 15 percent of the dust events coming to the region and the presence of the annual average of 50 times of the internal dust (with the concentration maximum of PM10 particles more than 8000p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014